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"METHOD OF BUILDING A VARIABLE-LENGTH ERROR CODE" 

FIELD OF THE INVENTION 

The present Invention relates to a method of building a variable length error 
code, said method comprising the steps of : 
5 (1) initializing the needed parameters : minimum and maximum length of 

codewords L x and Lmax respectively, free distance d rree between each codeword (said 
distance df** being for a VLEC code C the minimum Hamming distance in the set of all 
arbitrary extended codes), required number of codewords S ; 

(2) generating a fixed length code C of length L x and minimal distance tw 
10 with bmin = min {b k ; k « 1, 2, , R}, b k = the distance associated to the codeworcl 

length L k of code C and defined as the minimum Hamming distance between all codewords 
of C with length 1*, and R = the number of different codeword lengths in C, said 
generating step creating a set W of n-bit long words distant of d ; 

(3) storing in the set W ail the possible L x - tuples distant of d mIn from the 
15 codewords of C (said distance d mIn for a VLEC code C being the minimum value of all the 

diverging distances between all possible couples of different-length codewords of Q, and, 
if said set W is not empty, affixing at the end of all words one extra bit, said storing step 
replacing the set W by a new one having twice more words than the previous one and the 
length of each one of these words being L t + 1 ; 
20 (4) deleting all the words of the set W that do not satisfy the c^m distance 

with all codewords of C, said distance c^n being the minimum converging distance of the 
codeC; 

(5) in the case where no word is found or the maximum number of bits is 
reached, reducing the constraint of distance for finding more words ; 

25 (6) controlling that all words of the set W are distant of b^ the found words 

being then added to the code C ; 

(7) if the required number of codewords has not been reached, repeating the 
steps (1) to (6) until the method finds either no further possibility to continue or the 
required number of codewords ; 

30 (8) if the number of codewords of C is greater than S, calculating, on the basis 

of the structure of the VLEC code, the average length AL obtained by weighting each 
codeword length with the probability of the source, said AL becoming the AUn, n if it is lower 
than ALm In , with ALmm = the minimum value of AL, and the corresponding code structure 
being kept in memory. 

35 The invention also relates to a corresponding device. 



BACKGROUND OF THE INVENTION 




signals coming from a source, a source coder 1 followed by a channel coder 2 and, after 
the transmission of the coded signals thus obtained through a channel 3, a channel 
decoder 4 and a source decoder 5 (the decoded signals are intended to be sent towards a 
receiver). Variable-length codes (VLC) are classically used in source coding for their 
compression capabilities, and the associated channel coding techniques combat the effects 
of the real transmission channel (such as fading, noise, etc.). However, since source ceding 
is intended to remove redundancy and channel coding to re-introduce it, it has been 
investigated how to efficiently coordinate these techniques in order to improve the overall 
system while keeping the complexity at an acceptable level. 

Among the solutions proposed in such an approach, the variable-length error 
correcting (VLEQ codes present the advantage to be variable-length while providing error 
correction capabilities, but building these codes is rather time consuming for short 
alphabets (and become even prohibitive for higher length alphabets sources), and the 
construction complexity is also a drawback, as it will be seen. 

First, some definitions and properties of the classical VLC must be recalled. A 
cade C is a set of S codewords {c u Q>, c 3 ,.„, q,.., Cs}, for each of which a length «, = |q|. is 

defined, with h < l 2 < t 3 £ ^ ^ k without any loss of generality. The number of 

different codeword lengths In the code C is called R, with obviously R < S, and these . 
lengths are denoted as U, L* L 3 , ,L„ U, with U < L 2 < L 3 < < U- A variable- 
length code, or VLC, is then the structure denoted by (si@ Li, s 2 @ L 2 , s 3 @ L 3 , , s R @ 

U), which corresponds to s t codewords of length L*, S2 codewords of length L 2 , s 3 

codewords of length L3, , and s R codewords of length L R . When using a VLC, the 

compression efficiency, for a given source, is related to the number of bits necessary to 
transmit symbols from said source. The measure used to estimate this efficiency is often 
the average length AL of the code (i.e. the average number of bits needed to transmit a 
word), given, when each symbol a» is mapped to the codeword q, by the following relation 



where, for a data source A, the S source symbols are denoted by {ai, a^ a 3 , , a s } and 

P(a,) is the respective probability of occurrence of each of these symbols, with 2 P(ai) = 1 
(from i = 1 to i « S). If AUfin denotes the minimal value for the average length AL, it is 



(1): 



AL-J^.PCa,) 



(1) 



which is equivalent to the relation (2) : 




(2) 
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easy to see that when AUm is reached, the symbols are indexed in such a way that P(aO ^ 
P(aa) > P(a 3 ) >...£ P(a,) S ...P(a s ). In order to encode the data in such a way that the 
receiver can decode the coded information, the VLC must satisfy the following properties : 
to be non-sfngular (all the codewords are distinct, i.e. no more than one source symbol is 
allocated to one codeword) and to be uniquely decodable (i.e. it is possible to map any 
string of codewords unambiguously back to-the correct source symbols, without any error). 

An introductton and a presentation of different distances that are useful when 
reviewing some general properties of the VLC codes will then help to recall the notion of 
error-correcting property used in the VLEC code theory : 

(a) Hamming weight and distance : if w Is a word of length n with w = (wj, 
W2,..., w„), the Hamming weight of w, or simply weighty is the number W(w) of non-zero 
symbols In w : 

w ( w )=£ A (3) 

and, if Wjt and w 2 are two words of equal length n with w, = (wn, wq, Wq,...., w, n ) and i - 1 
or 2, the Hamming distance (or, simply, distance) between w x and w 2 is the number of 
positions in which Wj and w 2 differ (for example, for the binary case, it is easy to see that : 
H(Wi, w 2 ) = W(wi + w 2 ) (4) 

where the addition Is modulo-2) However, the Hamming distance is by definition restricted 
to fixed-length codes, and other definitions will be defined before considering VLEC codes. 

(b) let = wj w 2 .... w* be a concatenation of n words of a VLEC code C, 
then the set Fm - {f, : |f,| = N} is called the.extended code of C of order N. 

(c) minimum block distance and overall minimum block distance : the 
minimum block distance bk associated to the codeword length L* of a VLEC code C is 
defined as the minimum Hamming distance between all distinct codewords of C with the 
same length t* : 

bk = min {H(c c,) : q, q e C, I *j, |q| m |q| =U} for k = 1,..., R (5) 
and the overall minimum block distance bmm of said VLEC code C, which is the minimum 
block distance value for every possible length L*, Is defined by : 

b m = min {b k : k - 1,... R> (6) 

(d) diverging distance and minimum diverging distance : the diverging 
distance between two codewords of different length q = x t , x i2 ....x^ and 

9) = Xj! x J2 .... x J<( of a VLEC code C, where q, q e C, t, = |q| and fj - |q| with i, > l u is 
defined by : 

D (C U Cj) = H(X t Xfc .... Xi,. , x h x fe x }e . ) (7) 

i.e. It is also the Hamming distance between a e, - length codeword and the ij - length 
prefix of a longer codeword, and the minimum diverging distance d^,, of said VLEC code C 
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is the minimum value of all the diverging distances between all possible couples of 
codewords of C of unequal length : 

dmin = mln { D(c { tCj ) : cj.cj 6 C,|ci| * | Cj | } (8) 

(e) converging distance and minimum converging distance : the converging 
distance between two codewords of different length q « x i2 ....x^ f and 

q » x j2 .... x^ of a VLEC code C, where |q| = t| > |q|= i jf is defined by : 

C(q, q) = H ( x,^ x l<f . £j+2 ... jc^ ,~x h x te ...jc u J ) (9) 

i.e. it is also the Hamming distance between a «j - length codeword and the % - length suffix 
of a longer codeword, and the minimum converging distance of said VLEC code C is the 
minimum value of all the converging distances between all possible couples of C of unequal 
length : 

c m m — mln {C(Cj ,Cj) !.Cj,Cj e C, |c. | * |c J } (10) 

(0 free distance : the free distance d^ of a code is the minimum Hamming 
distance in the set of all arbitrary long paths that diverge from some common state Stand 
converge again in another common state Sj, with j > i : 

d free = min {H(f,, f,) : ft, fj e F N/ N = 1, 2,...., «> } (11) 

Following the structure model used for a VLC, it is therefore possible to 
describe the structure of the VLEC code C by the notation : 

Si @ L lf b t ; S 2 @ L 2r b 2 ; .... ; Sr @ Lr, b R ; d^n, Qmn (12) 
where there are S\ codewords of length L| with minimum block distance b|, for all i = 1, 2,... 
R, (it is recalled that R is the number of different codeword lengths) and minimum 
diverging and converging distances d m)n and c mi h.The most important parameter of a VLEC - 
code is its free distance dfree/ which influences greatly its performance in terms of error- 
correcting capabilities, and it can be shown that the free distance of a VLEC code is 
bounded by : 

dfree * nriin (b m m, dmin + Onto) (13) 

These definitions being recalled, the state-of-the-art in VLEC codes 
construction will be now described more easily. The first types of VLEC codes, called a - 
prompt codes and introduced in 1974, and an extension of this family, called a tl> t 2 ^.,t R - 

prompt codes, have both the same essential property : if one denotes by a(q) the set of 
words that are closer to q than to any codeword Cj, with j * i, no sequence in a(q) is a 
prefix of a sequence in another a(q) . The construction of these codes is very simple, and 
the construction algorithm is adjustable by the number of codewords at each length, which 
makes possible to find the best prompt code for a given source and a given d^ee. However, 
this best code performs poorly in terms of compression performance. 
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A more recent construction, allowing the construction of a VLEC code from the 
generator matrix of a fixed-length linear block code, was proposed in the document 
"Variable-length error-correcting codes" by V.Buttigleg, Ph.D.Thesis, University of 
Manchester, England, 1995. Called code-anticode construction, this algorithm relies on line 
combinations and column permutations to form an anticode at the rightmost column. Once 
the code-anticode generator matrix is obtained, the VLEC code is simply obtained by a 
matrix multiplication. 

This technique has however several drawbacks. First, there is no explicit 
method to find the needed line combinations and column permutations to obtain the 
anticode. Moreover, the construction does not take into account the source statistics and, 
consequently, often reveals itself sub-optimal (one can find a code with smaller average 
length by a post-processing on the VLEC code). In the same document, the author has 
then proposed an improved method, called Heuristic method, that Is based on a computer 
search for building a VLEC code giving the better known compression rate for a specified 
source and a given protection against errors, i.e. a code C with specified overall minimum 
block, diverging and converging distances (and hence a minimum value for d^e) and with 
codeword lengths matched to the source statistics so as to obtain a minimum average 
codeword length for the chosen free distance and the specified source (in practice, one 
takes : b mIn = d min +c m]n ^ d^, and : d m m = [d free /23. 

The main steps of this Heuristic method, which uses the following 
parameters : minimum length L x of codewords, maximum length L^ of codewords, free 
distance d^ between each codeword, number S of codewords required, are now described 
with reference to the flowcharts of Figs 2 to 4. 

To start the computer search ("Start"), all the needed parameters must be 
first specified : Lt (the minimum codeword length, which must be at least equal to or 
greater than the minimum diverging distance required), L^ (the maximum codeword 
length), the different distances between codewords (d^e, b m(n , d min , q^), and S (the 
number of codewords required by the given source), and some relations are set when 
choosing these parameters : 

Lt * d m!n 

• bmIn — dfree 
dmin + Qni n = d frea 

The first phase of the algorithm, referenced 11, is then performed : It consists 
in the generation of a fixed length code (put Initially in C) of length Li and minimal distance 
bmin, with a maximum number of codewords. This phase is in fact an Initialization, 
performed for Instance by means of an algorithm such as the greedy algorithm (GA), 
presented in Rg.5, or the majority voting algorithm (MVA), presented in FIg.7, or a new 
proposed variation, denoted by GAS (Greedy Algorithm by Step), which consists in a 
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variation of the two above mentioned ones. The GAS consists in the search method used in 
the GA, where instead of deleting half of the codewords, only the last codeword of the 
group is deleted. These two algorithms are useful to create a set W of n-bit long words 
distant of d (in practice, it may be noted that the MVA finds more words than the GA, but it 
asks too much time for only a small improvement of the compression capacity, as shown in 
the tables of Figs 6 and 8, which compare, respectively for the GA and for the MVA, the 
best code structures obtained with different values of dfree for the 26-symbol English source 
defined in the table of Rg.9." 

The second phase of the algorithm, corresponding to the elements referenced 
21 to 24 (21+22 « operation AO ; 23+24 = operation A2) in Rg.2, consists in listing and 
storing (step 21) in a set called W all the possible U - tuples at the distance of d m]n from 
the codewords in C. If dmm £ b m i n , then W is empty. If this set W of ail the words satisfying 
the minimum diverging distance to the current code is not empty (reply NO to the test 22 
: | W| « 0 ?), the number of words in W is doubled by increasing the length of the words by 

one bit by affixing first a "0" and then a "1" to the rightmost position of all the words in W 
(step 24), except if the maximum number of bits is exceeded (reply YES to the test 23). At 
the output of said step 24, this modified set W has twice more words than the previous W, 
and the length of each one is L* + 1. 

The third phase of the algorithm, corresponding to the elements 31 to 35 ( = 
operation A3 In Rg,2), consists in deleting (step 31) all the words of set W that do not 
satisfy the Cmfn distance (minimum converging distance) with all the codewords of C (i.e. in 
storing in a new W only the words which satisfy said minimum converging distance, the 
other ones being discarded). At this point, the new set W is a set of words which, when 
compared to the codewords of C, satisfy the required minimum diverging and converging 
distances (both dmm and distances) with the codewords of C. If that new set W is not 
empty (reply NO to the test 32 : |w| = 0 ?) one selects in W (step 33) the maximum 

number of words to satisfy the minimum block distance, In order to ensure that all the 
words of the set W, being of the same length, have a minimum distance at least equal to 
b m i n . At the end of this step 33, realized with the GA or the MVA (note that in this case, the 
initial set used for the GA or the MVA is the current W and not a n-tuples set), the words 
thus obtained are added (step 34) to the codewords already in C. 

If no word is found (i.e. W is empty) at the end of the step 21 (reply YES to 
the test 22 : |w| = 0 ?) or if the maximum number of bits is reached or exceeded (reply 

YES to the test 23), one enters the fourth part of the algorithm (steps 41 to 46, illustrated 
in Rg.3), which is used in order to unjam the process by inserting more liberty of choice, 
more particularly by affixing to all words in W extra bits (possibly several bits at the same 
time) so that the new group contains more bits than the old one. If there are enough 
codewords in the last group, some of them are deleted from this said group (as described 
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above), such deletions allowing to reduce the distance constraint and to find more 
codewords than before. As a matter of fact, the classical Heuristic method thus described 
begins with the maximum of codewords with the short length, maps them with the high 
probability symbols and tries to obtain a good compression rate, but sometimes the size of 
the small lengths sets are incompatible with the required number of codewords S. In this 
optic, easing a few codewords provides more freedom degrees and allows to reach a 
position where the initial requirements on distance and number of symbols for the code 
can be met. This deletion process is repeated until it remains a maximum of one codeword 
for each length. If W is empty at the end of the step 31 (reply YES to the test 32 : Jw| « 0 
?), the steps 23, 24, 31, 32 are repeated. If the required number of codewords has not 
been reached (reply NO to the test 35 provided at the end of this third phase), the steps 
21 to 24 and 31 to 35 must be repeated until said steps find that either there are no 
further possible words to be found or the required number of codewords is reached. 

If said required number of codewords has been reached (i.e. the number of 
codewords of C is equal to or greater than S (reply YES to the test 35), the structure of the 
VLEC code thus obtained is used in a fifth part, including the steps 51 to 56 (illustrated in 
Rg.4), in order to calculate the average length AL This is done by weighting each 
codeword length with the probability of the source, and comparing It to the current best 
one. If said average length AL of this VLEC code is lower than the minimized value of AL (= 
AUin)/ this AL becomes the AUm, and this new AL value and the corresponding code 
structure are kept in memory (step 51). These steps 51 and following (fifth part ; operation 
A4) allow to come back, within the algorithm, towards previous groups (i.e. to smaller 
word lengths), whereas the other parts of said algorithm are always performed on the 
current group or generating new ones. The stepsize for such a feedback operation is one, 
i.e. this feedback action can be considered as exhaustive. 

To continue this search of the best VLEC code, It is necessary to avoid keeping 
the same structure, which would lead to a loop in the algorithm. The last added group of 
the current code is deleted (steps 52, 53), the deletion of shorter length codewords 
allowing to find more longer length codewords, and some codewords (half the amount for 
the GVA ; the "best" one for the MVA) of the previous group are deleted (step 55), in order 
to re-loop (step 56) the algorithm at the beginning of the step 21 and find different VLEC 
structures (the number of deleted codewords depends on which method is used for 
selecting the words : if the GA method is used and one wants to obtain a linear code, It Is 
necessary to delete half of the codewords, while with the MVA method only one codeword, 
the best one, Is deleted, i.e. the one that allows to find the more codewords In the next 
group). 

However, the Heuristic method thus described often considers very unlikely 
code structures or proceeds with such a care (in order not to miss anything) that a great 
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complexity is observed in the implementation of said method, which moreover is rather 
time consuming and can even become prohibitive. It has therefore been proposed, in a 
European patent application filed on October 23, 2002, with the filing number 02292624.0 
(PHFR020110), an improved construction method with which it is possible to gain in 
complexity by avoiding these drawbacks, said method of building a variable length error 
code comprising, more precisely, the steps of : 

(1) initializing the needed parameters : minimum and maximum length of 
rodewbrds Li arid Una respectively, "free distance d^e betweeheach codeword (said 
distance d free being for a VLEC code C the minimum Hamming distance in the set of all 
arbitrary extended codes), required number of codewords S ; 

(2) generating (step 11) a fixed length code C of length U and minimal 

distance tw with b m]n = min {b k ; k = 1,2, , R}, b k = the distance associated to the 

codeword length U of code C and defined as the minimum Hamming distance between ail 
codewords of C with length U, and R = the number of different codeword lengths in C, 
said generating' step 11 creating a set W of n-bit long words distant of d ; 

(3) listing and storing (step 21) in the set W all the possible U - tuples at the 
distance of d mln from the codewords of C (said distance d mm for a VLEC code C being the 
minimum value of all the diverging distances between all possible couples of different- 
length codewords of C), and, if said set W is not empty, doubling the number of words in 
W by affixing at the end of all words one extra bit, said storing step therefore replacing the 
set W by a new one having twice more words^than the previous one and the length of each 
one of these words being U + 1 ; 

(4) deleting (step 31) all the words of the set W that do not satisfy the c^n 
distance with all codewords of C, said distance Cmin being the minimum converging distance 
of the code C; 

(5) in the case where no word is found or the maximum number of bits is 
reached, reducing (step 41) the constraint of distance for finding more words ; 

(6) controlling that all words of the set W are distant of b mllt , the found words 
being then added to the code C (step 34) ; 

(7) if (step 35) the required number of codewords has not been reached, 
repeating the steps (1) to (6) (i.e. the steps 21 to 35) until the method finds either no 
further possibility to continue or the required number of codewords ; 

(8) if the number of codewords of C is greater than S, calculating (operation 
A4), on the basis of the structure of the VLEC code, the average length AL obtained by 
weighting each codeword length with the probability of the source, said AL becoming the 
AUin/ If It is lower than AUm, with AUm = the minimum value of AL, and the corresponding 
code structure being kept in memory ; 

said building method being moreover such that at most one bit is added at the end of each 
word of the set W. 



9 

PHFR030031 EPp 

Simulations show that, with the classical Heuristic method, almost none of the 
obtained best codes has a hole (I.e. a length jump In Its structure length). It Is then 
considered, In the previously cited European patent application, that most good codes do 
not have jump of length and, therefore, that the set of examined VLEC codes can be 
reduced accordingly (which reduces the simulation time and the complexity of 
Implementation of the method, without modifying much the AL). Following this hypothesis, 
the method has been, according to said European patent application, modified by avoiding 
to add more than one bit at the end of each word of the set W. The corresponding 
implementation (improved Heuristic construction method, also called "noHole optimization" 
method) is illustrated in Figs 10 and 11, which show the two parts of a flowchart 
corresponding to said method (the elements that are identical to the ones observed in 
Figs.2 to 4 being designated with the same references). With respect to the flowchart of 
Figs.2 to 4, the parts that, with respect to the classical Heuristic technique, are useless for 
the implementation of the improved method have been cancelled : 

(a) if W is empty at the end of the step 31 (reply YES to the test 32 : |w| - 0 ?), the 
next phase Is now (see Fig. 10) not the repetition of the steps (23, 24, 31, 32), but the 
establishment (in place of said repetition) of a direct connection 91 towards the input of 
the circuit carrying out the operation 55 (deletion of some codewords, or of the best one, 
before a repetition of the steps 21 to 24 and 31 to 35), said operation 55 being then, as 
previously, followed by the operations 21 and following. 

(b) the fourth phase of the method is now reduced to one step, the operation 41, 
which is the test "Number of codewords in last group « 1 T . If the reply is NO, a direct 
link is established with the Input of the step 55 (connection 91), in view of carrying out said 
operation 55, and then the operations 21 and following. If the reply is YES, a connection 92 
is established with the input of the set of operations 52 to 54. 

The results thus obtained are presented in the table of Fig.12 for the 26 
symbol English source when using the GAS method for selecting codewords. It can be 
seen, when comparing with results presented in Fig.13, that although the result Is not 
completely optimal for dfr ee = 3 (the code structure has a hole at length L ^ 11), the AL rise 
is really acceptable when one considers that there is both strictly no degradation for the 
other d^ values and a gain of time between 2,5 and 4. The same remarks can be applied 
when comparing the present solution with the ones obtained In Fig.7, where the MVA 
complexity effect is clear. Similarly, applying the noHole optimisation with the GA method 
for selecting codewords leads to a time gain at the only expense of a slight AL rise for 
dfrce=3. Finally, Rg.5 shows on the other hand that the current solution offers better ALfor 
an acceptable gain of time, the noHole optimisation compensating almost entirely the 
complexity Induced by the GAS. 
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However, with the method thus described In said European patent application, 
there are cases where there are too many small length codewords in the generated VLEC 
code. It has then been proposed, in another European patent application filed on March 11, 
2003, with the filing number 03290604,2 (PHFR030026), another improved building 
method according to which the group deletion is not only performed with the last obtained 
codewords group, but more generally with groups up to a given length value group, in 
order to make possible to go back directly, and therefore very quickly, to smaller lengths, 
"i.e. toskip many Algorithm steps in cases where there are loo many smair length 
codewords. More precisely, denoting by U (with s for : skip) the length to which the 
algorithm will skip back to in the codeword deletion stage, it is proposed to skip parts of 
the original Heuristic algorithm by carefully jumping to lower lengths when looking for 
codewords to be deleted (however, when the considered codewords group length L is 
smaller than a preset value U, it is obviously better to apply the previous method, and the 
deletion is then done within the group of length L). The length comprised between U and 
U are consequently called "free lengths", i.e. lengths with a freedom degree, as they are 
decremented one by one in the search process (when the number of free lengths grows 
up, the simulation time also increased, exponentially). This method, called "U optimization 
adding", is depicted in the flowchart formed by the association of Fig.10 (unchanged part . 
of the previous method, the so-called noHole optimization method) and Fig. 14 (modified 
part of the noHole optimization method). 

Said Fig.14 is adapted from Fig.ll according to the following indications. The 
last added group of the current code is deleted, but only if (test 61) the codeword length of 
this previous group is (reply NO to the test 61) lower than or equal to U (the steps that 
follow the test 61 are then the same as previously : steps 53, 54, and 55 or 52 at the 
output of the test 54). If said codeword length is greater than U (reply YES to the test 61), 
an additional step 62 is provided for going to group with U-bit long codewords and deleting 
all groups with more than U bits. At the output of the step 62, the same steps 54, 55 as 
previously are provided. In practice, simulation results show that good compression rates 
can be obtained for U < L(max), where L(max) is the maximal authorized codeword length 
(it can be noted that increasing the value of U results in an improvement of the AL value 
until a constant floor - the best value - is reached, and this behaviour then suggests a 
possible dynamic choice of U, starting with U = U and incrementing it until said floor is 
reached). It seems however that, unfortunately, said U optimization method may be not 
consequent enough to decrease the computation time in all situations. 

SUMMARY OF THE INVENTION 

It is therefore an object of the invention to propose an improved construction 
method with which this drawback is avoided and better codes can be obtained, with an 
acceptable computation time. 
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To this end, the Invention relates to a method such as defined in the 
Introductory part of the description and which Is moreover characterized in that, 
considering that all distributions of number of codewords for the best VLEC codes have a 
similar curve allure of a bell shape type, it is defined an optimal length value Lm until which 
the number of codewords increases with their length, whereas it decreases after said value 
Lm, said definition allowing to apply the so-called "Ls optimization" method with avoiding 
the edges of the curve and to work locally. 

According to a possible improved implementation, the invention relates to a 
similar method (i.e. such as defined in the introductory part of the description), but which 
is now preferably characterized in that the deletion is realized not only in the last obtained 
group but also in the group of a given length value, in order to go back very quickly to 
smaller lengths, and, considering that all distributions of number of codewords for the best 
VLEC codes have a similar curve allure of a bell shape type, it is defined an optimal length 
value Lm until which the number of codewords increases with their length, whereas it 
decreases after said value Lm, said definition allowing to apply the so-called "Ls 
optimization" method with avoiding the edges of the curve and to work locally. 

It is also an object of the invention to propose a device for carrying out said 
construction method. 

To this end, the invention relates to a device for carrying out a variable length 
error code building method according to anyone of the two solutions thus proposed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will now be described, by way of example, with 
reference to the accompanying drawings in which : 

- Fig.l depicts a conventional communication channel ; 

- Figs. 2 to 4 are the three parts of a single flowchart illustrating the main 
steps of a conventional method used for building a VLEC code (and called Heuristic 
method) ; 

- Fig.5 illustrates an algorithm (called greedy algorithm, or GA) used for the 
initialization of the method of Figs. 2 to 4, and Fig.6 is a table giving various VLEC codes 
for a source constructed with the Heuristic construction using said algorithm of Fig.5 ; 

- Fig.7 illustrates another algorithm (called majority voting algorithm, or MVA) 
used for the initialization of the method of Figs. 2 to 4, and Fig.8 is another table giving 
various VLEC codes for a source constructed with the Heuristic construction using said 
algorithm of Fig.7 ; 

- Fig.9 is a tabie^giving for the 26-symbol English source the correspondence 
between the source symbol and its probability ; 

- Figs. 10 and 11 are the two parts of a single flowchart illustrating an 
implementation of an improvement of the conventional method illustrated in Figs.2 to 4 ; 
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- Fig.12 is another table giving various VLEC codes for the same 26 symbol 
English source as considered in the tables of Figs.6 and 8 and using the GAS ; 

- Fig, 13 is another table giving various VLEC codes for the same source as in 
Fig.12 and using both the GAS previously mentioned and the building method according to 
the improvement illustrated in Figs.10 and 11 ; 

- Fig.14 shows the modification of the part of flowchart of Fig.ll according to 
another improvement of the conventional method illustrated in Figs.2 to 4 ; 

" - Fig^lS showsTwitfi respect fotfies^ bylhe association" 

of Figs.10 and 14, a modification of the part of flowchart of Fig.10 when the method 
according to the invention is carried out (only the lower modified part, with respect to the 
original Fig.10, is shown) ; 

- Fig.16 is a table illustrating the results obtained for the 26-symbol English 
source when the previous method of Hg.14 ("Ls optimization" method) is carried out ; 

- Fig.17 is a table illustrating the results obtained for the 26-symbol English 
source when the method according to the invention is carried out. 

DETAILED DESCRIPTION OF THE INVENTION 

Considering the results of some simulations made on the basis of the classic 
Heuristic method or the modified ones ("noHole optimization" method, "Ls optimization" 
method), it appears that all distributions of number of codewords for the best VLEC codes 
found in said simulations have a similar aspect : the number Nc(L) of codewords at length 
L versus the codeword length L is a curve generally exhibiting a bell shape. It means that, 
up until a given length Lm (with m for : middle), the number of codewords increases with 
the length, whereas, after said length Lm, the number of codewords decreases. 

With respect to the single flowchart formed by the association of Fig.10 and 
Fig.14, the previous remark is therefore exploited by means of a modification of the part of 
flowchart of Fig.10 : the method according to the invention is illustrated in Fig.15, which 
must be considered together with the remaining upper part of Fig.10 and with said Fig.14 
in order to form the new single flowchart corresponding to the implementation of said 
method. This method, called "Lm optimization", is an adaptation of the "Ls optimization" 
method, and this adaptation is done by means of an introduction of the technical measures 
now described. 

According to said measures, three circuits (test circuits 71 and 72, and a 
computing circuit 73) are added to the solution shown in Fig.10, between the circuits 33 et 
34 : the method is indeed implemented after the circuit 33, where the set W is in 
accordance with all the different distances, and before the circuit 34, that performs the 
addition of the words of W to the code C Between these two circuits, additional tests are 
carried out according to the invention. 
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First, one assumes that the word length Lw In W Is lower than Lm. To respect 
the allure of the curve, the size of the set W must not be smaller than the last group one. 
If said size is smaller than the size of the last group (reply YES to the test 71), the steps 21 
to 24 and 31 to 35 must be repeated, as explained above when describing the previous 
implementations. If said size of W is greater than the size of the last group or if the word 
length Lw is greater than Lm (reply NO to the test 71), the test 72 is performed. 

If the word length Lw is now greater than Lm and lower than L(max) - 2 , the 
W size must be lower than the last group one. If it is not true (reply YES to the test 72), 
the number of words in W is too high and it must be set not more than the last group size, 
which is done in the circuit 73. Then the words in W are added to C (circuit 34), as 
previously with the "Heuristic", "noHole optimization" or "Ls optimization" method. If it is 
true (W size lower than the last group one : reply NO to the test 72), nothing is done 
before adding the words in W to the code C (circuit 73). To give some freedom to the 
method, which will allow to find the code allure with little oscillation, the allure constraint is 
tested not globally but locally, i.e. the inclination is verified for only two lengths : the length 
Lw of W and the length L(last_ group) of the last group. 

By comparing the results obtained with the "Ls optimization" method (table of 
Fig.16) and the ones obtained with the present "Lm optimization" method, illustrated in the 
table of Rg.17, it appears that the "Lm optimization" gives the same best code with only a 
small gain of time. It can be explained by the fact that sources with only a few number of 
codewords are already quickly dealt with the "Ls optimization" method, the "Lm 
optimizatton w one showing its interest especially when there is a higher number of 
codewords. 
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1. A method of building a variable length error code, said method comprising the 

steps of : 

(1) Initializing the needed parameters : minimum and maximum length of 
codewords L x and L^ax respectively, free distance d^e between each codeword (said 
distance d^e being for a VLEC code C the minimum Hamming distance in the set of all 
arbitrary extended codes), required number of codewords S ; 

(2) generating a fixed length code C of length L t and minimal distance b m m, 

with bmm = min {b k ; k = 1, 2, , R}, b k = the distance associated to the codeword 

length U of code C and defined as the minimum Hamming distance between all codewords 
of C with length I*, and R = the number of different codeword lengths in C, said 
generating step creating a set W of n-bit long words distent of d ; 

(3) storing In the set W all the possible U - tuples distant of dmi n from the 
codewords of C (said distance d mln for a VLEC code C being the minimum value of ail the 
diverging distances between all possible couples of different-length codewords of C), and, 
if said set W is not empty, affixing at the end of all words one extra bit, said storing step 
replacing the set W by a new one having twice more words than the previous one and the 
length of each one of these words being L x + 1 ; 

(4) deleting all the words of the set W that do not satisfy the c^ distance 
with all codewords of C, said distance Cmm being the minimum converging distance of the 
codeC; 

(5) in the case where no word is found or the maximum number of bits is 
reached, reducing the constraint of distance for finding more words ; 

(6) controlling that all words of the set W are distant of bmi n , the found words 
being then added to the code C ; 

(7) if the required number of codewords has not been reached, repeating the 
steps (1) to (6) until the method finds either no further possibility to continue or the 
required number of codewords ; 

(8) if the number of codewords of C is greater than S, calculating, on the basis 
of the structure of the VLEC code, the average length AL obtained by weighting each 
codeword length with the probability of the source, said AL becoming the AUnin if it is lower 
than ALmin, with AUm = the minimum value of AL, and the corresponding code structure 
being kept In memory ; 

said building method being moreover characterized in that, considering that all distributions 
of number of codewords for the best VLEC codes have a similar curve allure of a bell shape 
type, it is defined an optimal length value Lm until which the number of codewords 
increases with their length, whereas it decreases after said value Lm, said definition 
allowing to apply the so-called "Ls optimization" method with avoiding the edges of the 
curve and to work locally. 
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2. A method of building a variable length error code, said method comprising the 

steps of : 

(1) initializing the needed parameters : minimum and maximum length of 
codewords U and Uax respectively, free distance d^ between each codeword (said 
distance dfree being for a VLEC code C the minimum Hamming distance in the set of all 
arbitrary extended codes), required number of codewords S ; 

(2) generating a fixed length code C of length U and minimal distance 
with b^ "= m!n "{b k ; k =T, 2, .„.;., R},"b k - the distance associated to the codeword 
length L k of code C and defined as the minimum Hamming distance between all codewords 
of C with length L*, and R =* the number of different codeword lengths in C, said 
generating step creating a set W of n-bit long words distant of d ; 

(3) storing in the set W all the possible L x - tuples distant of d min from the 
codewords of C (said distance d mIn for a VLEC code C being the minimum value of all the 
diverging distances between all possible couples of different-length codewords of C), and, 
if said set W is not empty, affixing at the end of all words one extra bit, said storing step 
replacing the set W by a new one having twice more words than the previous one and the 
length of each one of these words being U + 1 ; 

(4) deleting all the words of the set W that do not satisfy the c^m distance 
with all codewords of C, said distance c^ being the minimum converging distance of the 
code C ; 

(5) in the case where no word is found or the maximum number of bits is 
reached, reducing the constraint of distance for finding more words ; 

(6) controlling that ail words of the set W are distant of b m m, the found words 
being then added to the code C ; 

(7) if the required number of codewords has not been reached, repeating the 
steps (1) to (6) until the method finds either no further possibility to continue or the 
required number of codewords ; 

(8) if the number of codewords of C is greater than S, calculating, on the basis 
of the structure of the VLEC code, the average length AL obtained by weighting each 
codeword length with the probability of the source, said AL becoming the AUin If It is lower 
than ALmm, with AUm - the minimum value of AL, and the corresponding code structure 
being kept in memory ; 

said building method being moreover characterized in that the deletion is realized not only 
in the last obtained group but also in the group of a given length value, In order to go back 
very quickly to smaller lengths, and, considering that all distributions of number of 
codewords for the best VLEC codes have a similar curve allure of a bell shape type, it is 
defined an optimal length value Lm until which the number of codewords increases with 
their length, whereas it decreases after said value Lm, said definition allowing to apply the 
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so-called "Ls optimization" method with avoiding the edges of the curve and to work 
locally. 

3. A VLEC code building method according to anyone of claims 1 and 2, in which 
the optimal value for Lm is Lm = Ls +1. 

4. A device for carrying out a variable length error code building method 
according to anyone of claims 1 to 3. 
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Abstract 

The Invention relates to a variable-length error-correcting (VLEC) code 
technique, in which the main steps are : defining all the needed parameters, generating a 
code having a fixed length LI, storing in a set W thus obtained all the possible Ll-tuples 

5 distant of the minimum diverging distance d[min] from the codewords (one extra-bit being 

affixed at the end of all words if the new set W thus obtained is not empty), deleting all 
words of W that do not satisfy a distance criterion with all codewords, and verifying that all 
words of the final set W satisfy another distance criterion. Assuming that all distributions of 
number of codewords for the best VLEC codes have a similar curve allure of a bell shape 

10 type, it is then proposed, according to the invention, to define an optimal length value Lm 

until which the number of codewords increases with their length, whereas it decreases 
after said value Lm. According to this new construction method, called Lm optimization, it 
is possible to apply the so-called "Ls optimization' 1 method with avoiding the edges of the 
curve and to work locally. 
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